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ABSTRACT 



A method of cache testing and fault correction is imple- 
mented subsequent to wafer dicing. Cache testing is moved 
from wafer level to the built-in self tes t (BIST) at machine 
level. The BIST is utilized along with cache redundancy for 
fault correction. The processor initiates a cache line test 
usi ng BIST upon power-up. When the processor is powered 
up and the test mode pins are set for the array test, the array 
BIST test begins. The BIST traverses the array_an_d tests each 
word line for hardware faults. Upon detection of a fault, the 
current address is stored in one of N fault address registers 
contained in the processor. These fault address registers are 
used to address redundant cache lines and therefore act as 
"soft" fuses. The entire cache structure is traversed in this 
manner with the addresses of any line faults being stored. If 
the number of found faults, indicated by stored addresses, 
are less than the number of redundant fault lines, then the 
processor self test will proceed to the next test. However, if 
the number of faults exceed the maximum number N of 
redundant fault registers, then an error condition will be 
broadcast to either a set of pins or registers, and the cache 
will not be allowed to be enabled. 

12 Claims, 3 Drawing Sheets 
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SOFT FUSES USING BIST FOR CACHE SELF FIG. 3 is a block diagram showing a processor with guard 

TEST band hardware for BIST for cache array testing. 

BACKGROUND OF THE INVENTION DETAILED DESCRIPTION OF A PREFERRED 

5 EMBODIMENT OF THE INVENTION 

1. Field of the Invention J 

The present invention generally relates to cache memories J^T 8 ^ ^T^' * ad ™ ot l 

(or caches) implemented in very large scale integrated " G J > * here 13 f ow ° a J cache m » cr ° U . whl <* stores 

(VLSO circuits and, more particularly, to testing and fault * a ' a and . ^undant cache 12 « used to replace 

correction of cache arrays. in d ' fe f ve , ^ >n the cache 11. The budt-m self test (BIST) 

_ ; . 10 checker 13, which is part of a processor (not shown), scans 

2. Background Description ^ lines k cachc u {Q delect afly defectivc ^ The 

With larger caches, significant yield impacts can occur addresses of the defective lines are stored in registers in the 

due to bad word lines within caches. Often, greater than fifty processor, and when a stored address matches on a cache hit, 

per cent of the die is made up of memory cells. Any hard or a redundant line from the redundant cache 12 is substituted, 

soft failures within the memory cells can render the part 15 The outputs of the cache 11 and the redundant cache 12 are 

non-functional and, therefore, scrap. One method of reduc- combined in multiplexer 14. 

ing scrap has been to add redundant cache lines and use an , n opcration , once me cachc u is enabled> me processor 

array test to isolate bad word hues. Once the bad word line ^ access the cache b providing ^ addrcss m6Q ^ In 

* isolated a replacement lme is created by blowing a set of paralle , to lhe cache access> tne address index fc cfaecked 

fuses for that particular word lme Then, when the part is 20 a inst one of N ^ored redu ndant address registers to 

functioning and the cache is indexed with the original determine whether or nol the Unf . * a previous i y identified 

address, the redundant lme is used instead of the bad lme. bad word Une If the nne ^ not a match tQ ^ slQred 

Unfortunately, the array test and fuse blowing can only be redundant address, then the data is provided by the main 

performed at wafer test; that is, before the wafer is diced into cac h e structure 11 via multiplexer 14. If a match exists, then 

individual chips. This added test adds expense and com- 25 me data p rov ided by the redundant line in the redundant 

plexity that may be unacceptable for some production hard- cac he 12, also via multiplexer 14. If the operation is a write, 

ware. In addition, if the part deteriorates in the field, any then the cache line will be updated along with the redundant 

future failures cannot be detected and corrected, since the location if there is an address match, 

test will never be performed again. Due to these problems, Buih ^ the redundant rcg isters 12 is the same tag 

an alternative approach to array cache testing is needed. 30 structure and least recently (lru) algorithm as that in 

SUMMARY OF THE INVENTION tne cache slructure 11 - witn mis feature, the processor BIST 

w ^*. mechanism determines the bad cache fines, and through 

It is therefore an object of the present invention to provide registers and redundant tag/word lines replaces any bad lines 

a method of cache testing and fault correction which can be detected at every power up of the system, 

implemented subsequent to wafer dicing. 35 with this invention, no longer will valuable wafer test 

According to the invention, cache testing is moved from time, and expensive fuse blow equipment be required. In 
wafer level to the built-in self test (BIST) at machine level. addition, if a hard failure manifests itself after final chip test, 
The BIST is utilized along with cache redundancy for fault the failure can still be detected and corrected at the system 
correction. The processor initiates a cache line test using ^ level to yield functioning hardware. 
BIST upon power-up. When the processor is powered up and The invention may be implemented in a comprehensive 
the test mode pins are set for the array test, the array BIST array test program including wafer level testing as shown in 
test begins. The BIST traverses the array and tests each word FIG. 2. Cache arrays are tested at wafer level in function 
line for hardware faults. Upon detection of a fault, the block 21. The wafers that pass are processed through final 
current address is stored in one of N fault address registers 45 metalization in function block 22. The wafer is then diced to 
contained in the processor. These fault address registers are produce individual chips. The chips are used to build mod- 
used to address redundant cache lines and therefore act as u les in function block 23. The built modules are tested, and 
"soft" fuses. The entire cache structure is traversed in this those that are fixable by blowing fuses, are fixed in function 
manner with the addresses of any line faults being stored. If block 24. The process to this point is conventional. The 
the number of found faults, indicated by stored addresses, 5Q invention goes beyond this conventional process by imple- 
are less than the number of redundant fault lines, then the menting the array test in the built-in self test (BIST) of the 
processor self test will proceed to the next test. However, if processor. 

the number of faults exceed the maximum number N of At p0Wer . U p, th e processor initiates built-in self test 

redundant fault registers, then an error condition will be ^ {ST) ^ fu^on block 25. This BIST, however, imple- 

broadcast to either a set of pins or registers, and the cache 5S ments the array tcst describc d above. As part of the test, 

will not be allowed to be enabled. redundant registers in the processor are set in function block 

BRIEF DESCRIPTION OF THE DRAWINGS registers are loaded with the addresses of cache 

lines detected as faulty during BIST. These registers act as 

The foregoing and other objects, aspects and advantages soft fuses effectively substituting a redundant cache line for 

will be better understood from the following detailed 60 a detected faulty cache line. This test is performed at every 

description of a preferred embodiment of the invention with power-up so that random or hard failures that occur in the 

reference to the drawings, in which: fi e jd are detected and fixed with this soft fuse technique. 

FIG. 1 is a block diagram showing the general architec- Using this technique of BIST, a further enhancement can 

ture of a cache macro with redundant cache and BIST be made if the number of failures exceed N redundant 

checker; 65 register locations. Once the N number of redundant loca- 

F1G. 2 is a flow diagram showing the cache line checking lions have been filled with bad line locations, the cache error 

process implemented by the invention; and status would have to be declared to the system. 
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Subsequently, as indicated in function block 27, the bad if the part were slowing down due to a higher operating 
cache portion up to the word line width divided by the temperature. Once again, the BIST testing would occur for 
architectural word width, along with the number of bits are the cache, and upon completion, the voltage would be 
devoted to line marking. In the maximum case, each arc hi- returned to the normal operating range, 
tectural word width could be marked with a functional bit 5 a third approach for guard banding the soft fuse BIST 
depending upon the results of the BIST, as indicated in wou jd be to control via an I/O port the external cooling, and 
function block 28. If the N locations have been previously faus a u ow ^ part t0 ramp up t0 tne required high tempera- 
allocated to other bad lines, then the BIST program would ture functional range. Using an onboard thermocouple, the 
mark the additional failures with a non-functional value. BIST test would be run at the desired temperature range. 

Alternatively to adding bits to the cache to mark bad word 10 Due to the thermal mass of the chip, this test could be 

lines, a small register file could be used to store the word line completed within a thermal ramp window. The BIST would 

addresses of the bad word lines in excess of the ones that can mark the bad location, and immediately re -start the cooling 

be fixed by redundancy alone. The TAG and cache line fan via a second I/O write. 

replacement logic would query this small register file to Any or all combinations of the above guard band tech- 
ensure those bad cache lines were not used. « mques would f su ^ T en h ance the ability of the BIST tech- 

Once the processor cache is enabled, the non-functional nique to find cache line faults, 

locations will reduce the total number of cache locations An additional advantage of using the this method is the 

within the processor. Since most caches have multiple reductionof wafer test time. At wafer test, the BIST test can 

associativity (4-way, 8-way, etc.), then potentially, one of the test up to a predefined ratio of good sites versus bad sites. If 

"ways" for a single set of cache lines could not be included fo c rat j 0 ^ i ess lnen the maximum ratio, then the wafer test 

in the base set due to the non-functionality discovered can stop the cache testing resulting in dramatic reduction of 

during BIST. Due to the large number of cache lines, and the test time. The ability of shortening the wafer test time is a 

small number of bad word lines (beyond the already fixed unique result of the soft fuse ability. At module test, the 

lines that use redundant address locations) the overall impact entire cache test would be run to ensure at product ship the 

to the system performance would be minimal. If greater than cache was 100% fixable. 

X locations beyond I the ; maximum word lines was reached of whHe the inyention has been dGSCribed in terms of a 

bad locations, the BIST could notify the system via pins or ^ embodiment, those skilled in the art will recognize 

registers that the cache was bad, and would prevent any ^ me mveQtion m ^ ^ ^ modification wi|hin 

enabling of the cache. ^ (he spirft and of ^ appended claims 

Given the fact that on-chip caches are increasing in size Having thus descri bed our invention, what we claim as 

rapidly, it may be more cost effective not to have any word Qew desire to secure by letters patent is as follows: 

line redundancy, but only this small table or bad word line L A method of array cache testing implemented in a 

addresses. In this way, all chips are not penalize with a 3 to buil t_in ^lf test of a processor, the cache including cache 

5% cache area increase for redundancy at the expense of 35 redundancy for fault correction, the method comprising the 

minimally affecting overall system performance. s t eps 0 f : 

Another enhancement to the BIST power-up array test initiating cache line testing as part of the built-in self test 

with redundant registers would be to incorporate additional upon pr0 cessor power-up* 

guard band during the test. The intent of the guard band (raversi ^ and ^ Mch WOfd ^ fof hafd . 

would be to attempt to capture any cache lines that may be 40 ware 

cycle time dependent. A particular cache line may pass the „ „ , , , . 

test upon the initial BIST testing. As the hardware heats up, upon detect™ ' f * fault, storing a current address in one 

the cache unit begins to slow down. If the failure mechanism of N fau,t address re 8 istcfs stained m the processor; 
only shows up at the slower speed, then a fail escape could 

exist even after the BIST test Consequently, one approach 45 ^"g lhe fault addre *s registers to address redundant 

would be to guard band the BIST test with frequency cache ^ durin g subsequent operation of the proces- 

margin. sor - 

The hardware for guard banding is shown in FIG. 3. The fi The ™*° d °* ' ™Y cache testing recited in claim 1 

processor 30 includes a cache array 31, redundant cache 32 ^ TihcT com V r ™2 lhe ste P s ° f - 

and BIST 33, substantially as shown in FIG. 1. A clock 50 comparing a number of detected faults to N; and 

generator 35 generates clock signals to the BIST 33 and if the number of detected faults exceeds N, issuing a cache 

supplies clock pulses to a clock chopper 36 that drives the fault indication. 

cache array 31. During the BIST portion of the cache testing, 3. The method of array cache testing recited in claim 2 

the cache 31 is stressed by giving it a shorter cycle time. wherein if the number of detected faults exceeds N, further 

Since the required increase in frequency does not need to be 55 comprising the step of inhibiting the cache array from being 

very much to compensate for a temperature increase, a clock enabled. 

chopper 36 is used to clock a small delay to the read clock 4. The method of array cache testing recited in claim 2 

and write clock when a cache access is initiated, thereby further comprising the step of marking bad word lines in 

reducing the amount or time the cache 31 has to output its excess of N. 

data to the rest of the chip whose clocks have not been 6 q The method of array cache testing recited in claim 4 

altered. The amount of delay required to give the appropriate further comprising the steps of: 

guard band is determined by circuit simulation. The cache storing addresses of marked bad word lines in a register; 

recovery mechanism works the same as that described and 

earlier, querying the register prior to accessing the cache to insure 

The second guard band approach is to run the BIST cache 65 that bad word lines are not used, 

test at a lower operating voltage than the standard operating 6. The method of array cache testing recited in claim 1 

condition. The effect of the lower voltage would be the same wherein the cache has multiple associativity, further com- 
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prising the step of inhibiting one of the multiple associativity 
ways of the cache to exclude non-functional word lines in 
excess of N. 

7. The method of array cache testing recited in claim 1 
wherein the cache line testing of the built-in self test 
includes the step of guard banding the cache airay by testing 
at a different frequency. 

8. A processor including an array cache including cache 
redundancy for fault correction, the processor comprising: 

testing means for initiating cache line testing as part of a 
built-in self test upon processor power-up, said cache 
line testing traversing the array and testing each word 
line for hardware faults; 

detecting means responsive to the testing means for 
detecting a fault and storing a current address in one of 
N fault address registers contained in the processor; and 

cache accessing means for using the fault address regis- 
ters to address redundant cache lines during subsequent 
operation of the processor. 



10 



15 



9. The processor recited in claim 8 further comprising 
comparing means responsive to the detecting means for 
comparing a number of detected faults to N and, if the 
number of detected faults exceeds N, issuing a cache fault 
indication. 

10. The processor recited in claim 9 further comprising a 
register, the comparing means marking bad word lines in 
excess of N in the register, and the cache accessing means 
querying the register prior to accessing the cache to insure 
that bad word lines are not used. 

11. The processor recited in claim 8 wherein the cache has 
multiple associativity, further comprising means for inhib- 
iting one of the multiple associativity ways of the cache to 
exclude non-functional word lines in excess of N. 

12. The processor recited in claim 8 wherein the testing 
means includes a variable frequency clock generator for 
guard banding the cache array by testing at a different 
frequency. 
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